57 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Romanian
Availability:
Freely Available
License:
GByte
Size:
1.2 Production Status:
Existing-used
Use:
Speech Synthesis
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Romanian
Availability:
Freely Available
License:
<Not Specified>
Size:
9090921 words Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English German Romanian Russian
Availability:
From Owner
License:
<Not Specified>
Size:
2333 sentencesProduction Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech Romanian Slovak Spanish Vietnamese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English German Romanian Spanish Standard Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
65-350 words Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Bulgarian Czech English Hungarian Romanian
Availability:
From Data Center(s)
License:
ELRA
Size:
75Mbyte Production Status:
Existing-used
Use:
POS Induction
Paper:
N/A
Documentation:
English
Written
Lexicon,
Language Type:
Multilingual
Languages:
English French Italian Portuguese Romanian Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
900 KByte Production Status:
Newly created-finished
Use:
Lexicon Creation/Annotation
-
Paper title:Automatically Building a Multilingual Lexicon of False Friends With No Supervision
-
Paper track:Terminology/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ana Sabina Uban | FalseFriendsLexicon | /N |
Documentation:
https://github.com/ananana/false_friends_resource/blob/master/README.md
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org
Written
Corpus,
Language Type:
Monolingual
Languages:
Romanian
Availability:
Freely Available
License:
MIT
Size:
26377 entities OtherProduction Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:Introducing RONEC - the Romanian Named Entity Corpus
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Stefan Dumitrescu | Romanian Named Entity Corpus | /N |
Documentation:
Documentation is publicly available in English.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
Romanian
Availability:
Freely Available
License:
Creative Commons
Size:
5,600,000 tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Resources in Underrepresented Languages: Building a Representative Romanian Corpus
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ludmila Midrigan - Ciochina | Balanced Corpus of Romanian | /N |
Documentation:
Available documentation per request from authors (lmidrigan@ucdavis.edu and dpcorina@ucdavis.edu)




